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ABSTRACT Single-stranded DNA-binding proteins 
(SSBs) play essential roles in DNA replication, recombination, 
and repair in bacteria and eukarya. We report here the 
identification and characterization of the SSB of an archaeon, 
Methanococcus jannaschii. The M. jannaschii SSB (mjaSSB) 
has significant amino acid sequence similarity to the eukary- 
otic SSB, replication protein A (RPA), and contains four 
tandem repeats of the core single-stranded DNA (ssDNA) 
binding domain originally defined by structural studies of 
RPA. Homologous SSBs are encoded by the genomes of other 
archaeal species, including Methanobacterium thermoautotro- 
phicum andArchaeoglobus fulgidus. The purified mjaSSB binds 
to ssDNA with high affinity and selectivity. The apparent 
association constant for binding to ssDNA is similar to that of 
RPA under comparable experimental conditions, and the 
affinity for ssDNA exceeds that for double-stranded DNA by 
at least two orders of magnitude. The binding site size for 
mjaSSB is «*20 nucleotides. Given that RPA is related to 
mjaSSB at the sequence level and to Escherichia coli SSB at the 
structural level, we conclude that the SSBs of archaea, eu- 
karya, and bacteria share a common core ssDNA-binding 
domain. This ssDNA-binding domain was presumably present 
in the common ancestor to all three major branches of life. 



Single-stranded DNA-binding proteins (SSBs) play essential 
roles in most intracellular transactions involving DNA, includ- 
ing replication, recombination, and repair (1, 2). Members of 
this class of proteins have been identified in bacterial and 
eukaryotic cells and also are encoded by mitochondrial and 
viral genomes. Although there is little detectable amino acid 
sequence similarity between the bacterial and eukaryotic 
SSBs, recent evidence suggests these proteins have striking 
similarities at the structural level and likely are derived from 
a common ancestor (3-5). No SSB from the third major 
phylogenetic division, the archaea, has been identified, al- 
though recently published sequences of archaeal genomes 
contain genes whose predicted products have some sequence 
similarity to replication protein A (RPA), the eukaryotic SSB 
(6). 

RPA originally was purified as a protein essential for SV40 
DNA replication in vitro (7-9). Subsequent studies have shown 
that it is present in cells of all eukaryotic species from yeast to 
man (1). RPA is a heterotrimer composed of subunits of 70, 32, 
and 14 kDa. The central 250 amino acids of the large subunit 
(RPA70) contain two tandem single-stranded DNA-binding 
domains (3, 10-12). The N terminal region of RPA70 mediates 
interactions with a variety of cellular proteins, including other 
replication proteins and certain transcription factors (reviewed 
in ref. 1). The C terminal region of RPA70 interacts with 
RPA32 and is required for assembly of the trimeric complex 
(10-13). The middle subunit of RPA (RPA32) contains an- 



The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "advertisement" in 
accordance with 18 U.S.C. §1734 solely to indicate this fact. 

© 1998 by The National Academy of Sciences 0027-8424/98/95 14634-6S2.00/0 
PNAS is available online at www.pnas.org. 



additional single-stranded DNA-binding domain that is likely 
to be similar in structure to those present, in RPA70 (14, 15). 
It has been suggested on the basis of sequence analysis that the 
small subunit (RPA14) also may contain a single-stranded 
DNA-binding domain, but this has not been verified experi- 
mentally (14). 

A crystal structure at 2.4 A of the two DNA-binding 
domains of RPA70 bound to a single-stranded octanucleotide 
has been reported (3). Each domain is composed of an OB 
(oligonucleotide/oligosaccharide binding) fold (16) and con- 
tains a channel that binds three nucleotides in the DNA chain. 
The DNA-binding channels of the two domains are oriented so 
that the single-stranded DNA (ssDNA) extends from one 
domain to the other in roughly a straight line, with two 
nucleotides bridging the space between. Both domains interact 
with the DNA via hydrogen bonds to the bases and phosphate 
backbone, as well as via stacking interactions between con- 
served aromatic residues and the bases. Although RPA is the 
only SSB for which the structure of the protein-DNA complex 
is known, the structures of Escherichia coli SSB and human 
mitochondrial SSB in the absence of DNA have significant 
structural similarity to RPA (3-5, 17). 

Here, we report the identification and initial characteriza- 
tion of a SSB from the archaeon Methanococcus jannaschii. We 
initially identified a putative homologue of RPA70 in the 
published sequence of the complete M. jannaschii genome (18) 
and found closely related sequences in the subsequently pub- 
lished genomes of Methanobacterium thermoautotrophicum 
and Archaeoglobus fulgidus (6, 19). A similar sequence analysis 
was performed independently by Chedin et al (20). To deter- 
mine whether these proteins represent authentic achaeal SSBs, 
we expressed the putative M. jannaschii SSB in E. coli and 
purified it to homogeneity. The purified protein possesses 
single-stranded DNA-binding activity, and its interactions with 
oligonucleotides are similar in many respects to those of RPA. 

MATERIALS AND METHODS 

Expression and Purification of M \ jannaschii SSB (mjaSSB). 

Clone AMJF35, containing ORF MJ1159 of M. jannaschii, was 
received from the American Type Culture Collection. The 
MJ1159 ORF was amplified by PCR and was inserted between 
the BamHl and Hindlll sites of the bacterial expression vector 
pET-28b (Novagen). Transformants of E. coli BL21(DE3) 
were grown in Luria-Bertani medium and were induced by 
incubation in the presence of IPTG for 2.5 nr. Bacterial lysates 
were prepared by sonication in S buffer (50 mM NaH2PO<i, pH 
8.0/300 mM NaCl/10 mM imidazole) plus 2>g/ml leupeptin, 

1 mM phenylmethylsulfonyl fluoride, 1 /xg/ml pepstatin, and 

2 jxg/ml aprotinin. After clarification by centrifugation for 20 
min at 13,000 rpm in an Eppendorf centrifuge at 4°C, the lysate 



Abbreviations: SSB, single-stranded DNA-binding protein; RPA, rep- 
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coccus jannaschii SSB; mthSSB, Methanobacterium thermoautotrophi- 
cum SSB. 
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from 50-100 ml of culture was mixed with 1-2 ml of Ni-NTA 
matrix (Qiagen, Chatsworth, CA) for 60 min at 4°C with gentle 
shaking. The mixture then was poured into a column, was 
washed with S buffer containing 20 mM imidazole, and was 
eluted with S buffer containing 250 mM imidazole. The eluate 
was loaded onto a 1- to 2-ml column of ssDNA agarose 
(GIBCO/BRL), which was washed with buffer A (25 mM 
Tris-HCl, pH 7.8/1 mM EDTA/10% glycerol/0.01% Tween 
20/1 mM DTT/1 mM phenylmethylsulfonyl fluoride) con- 
taining 1M NaCl. The protein was eluted from the column with 
buffer A containing 2 M NaCl/40% ethylene glycol and was 
dialyzed against buffer A. Protein concentrations were deter- 
mined by Bradford assay with BSA as standard (21). 

Estimation of Native Molecular Weight. The Stokes radius 
of mjaSSB was determined by gel filtration of the purified 
protein at 4°C on a Superdex 200 HR 10/30 FPLC column 
(Pharmacia) equilibrated with buffer A (without phenylmeth- 
ylsulfonyl fluoride) containing 0.5 M NaCl. A set of globular 
protein standards was run separately for reference. The sed- 
imentation coefficient of mjaSSB was determined by density 
gradient ultracentrifugation of the purified protein mixed with 
protein standards. The conditions were identical to those used 
for gel filtration with the exception of the 10-30% glycerol 
gradient. The native molecular weight of mjaSSB was calcu- 
lated as described in Siegel and Monty (22). 

Gel Mobility Shift Assays. Oligonucleotides were obtained 
from Integrated DNA Technologies (Coralville, IA) and were 
end-labeled by using [•y 32 P]ATP (3,000Ci/mmol) and T4 
polynucleotide kinase. Binding reactions were incubated for 45 
min at 25°C in HI15 buffer (30 mM Hepes, pH 7.8/1 mM 
DTT/0.25 mM EDTA/0.5% inositol/15 mM KC1), and the 



reaction products were resolved by electrophoresis in 1% 
agarose gels as described (23, 24). 

Fluorescence Titrations. Binding reactions were assembled 
in 2 ml of FB buffer (30 mM Hepes, pH 7.8/100 mM NaCl/5 
mM MgCl 2 /0.5% inositol/1 mM DTT) and were incubated at 
either 25 or 50°C. A constant amount of mjaSSB was incubated 
with varying quantities of poly(dT) (Midland Certified Re- 
agent, Midland, TX). Fluorescence measurements were made 
on a Jpbin-Yvon (Longjumeau, France) Spex FluoroMax-2 
spectrof luorometer as described for RPA (23). The excitation 
wavelength was 295 nm (bandpass 2.5 nm), and the emission 
wavelength was 348 nm (bandpass 1 nm). 

RESULTS 

Putative Homologues of RPA in Archaea. The publication of 
the first complete sequence of an archaeal genome, M. jann- 
aschii, made it possible to identify candidate DNA replication 
proteins by sequence similarity searches (18). Although a 
number of such proteins readily were identified, no homologue 
of eukaryotic or bacterial SSBs was reported in the initial 
publication. Given that most M. jannaschii replication proteins 
appear to be related more closely to their eukaryotic coun- 
terparts (see Discussion), we sought to identify putative ho- 
mologues of human RPA70 (hRPA70). Surveys of the entire 
M. jannaschii genome with the program fasta (25) revealed an 
ORF (MJ1159) with low, but perceptible, sequence similarity 
to hRPA70. More detailed analysis demonstrated that ORF 
MJ1159 contains four closely related tandem repeats of a 
sequence encoding ^100 amino acids. Strikingly, each of the 
repeats has significant sequence similarity to the first single- 
stranded DNA-binding domain of hRPA70 (Fig. 1). In many 
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Fig. 1. Alignment of the sequences of the ssDNA-binding domains of M. jannaschii and M. thermoautotrophicum SSBs. The four repeats of 
mjaSSB and the five repeats of mthSSB are numbered sequentially. Identical residues are shaded black, and conserved residues are shaded gray. 
The sequence of the first ssDNA-binding domain of human RPA70 is shown below the consensus for the archaeal sequences. The shaded residues 
in the RPA sequence are similar or identical to those of the archaeal consensus. The boxed residues are those that make contact with the ssDNA 
in the crystal structure of hRPA70 (3). Arrows indicate the positions of the 0 sheets in the conserved OB (oligonucleotide/oligosaccharide binding) 
fold of hRPA70, and the gray bar indicates the position of an a helix. Aromatic residues corresponding to the conserved phenylalanine in the in 
the loop between sheets 4' and 5' of RPA70 are circled. 
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cases, the residues conserved among the four repeats of ORF 
MJ1159 correspond to those conserved in RPA70 subunits 
from yeast to man. In particular, most of the amino acids 
known to contact ssDNA in domain A of human RPA70 (3), 
including the aromatic residues involved in stacking interac- 
tions with the bases, are conserved in the individual repeats of 
the archaeal protein (Fig. 1). Consistent with these sequence 
similarities, the predicted secondary structure of the repeats in 
ORF MJ1159 (26) is compatible with the OB (oligonucleo- 
tide/oligosaccharide binding) fold observed in the two tandem 
DNA-binding domains of RPA70. These findings strongly 
suggest that the M. jannaschii SSB (mjaSSB) consists of four 
tandem repeats of a common core ssDNA-binding domain that 
is structurally similar to that of RPA70. It follows that that the 
eukaryotic and archaeal SSBs likely evolved from a common 
ancestor. 

After the analysis described above was completed, the 
complete sequences of two additional archaeal genomes were 
published (6, 19). Analysis of these genomes revealed that both 
contained homologues of mjaSSB. The genome of M. thermo- 
autotrophicum contains an ORF [MT1185/M thermoautotro- 
phicum SSB (mthSSB)] with five tandem repeats of the core 
ssDNA-binding domain. Fig. 1 shows the alignment of the nine 
repeats present in mthSSB and mjaSSB with the sequence of 
the ssDNA-binding domain of hRPA70. Of interest, the A. 
fulgidus genome encodes two ORFS (AF0382 and AF0780), 
each containing two putative ssDNA-binding domains in 
tandem. This finding may have some evolutionary significance 
because, as noted above, both the 70-kDa and 32-kDa subunits 
of hRPA appear to contain core ssDNA-binding domains. 
Thus, eukaryotes and at least one archaeal species have 
evolved two separate proteins with presumptive ssDNA- 
binding activity. It will be of interest to determine whether the 
two A. fulgidus SSBs exhibit functional differences. 

Expression and Purification of the RPA Homologue from M. 
jannaschii. To determine whether the ORFs identified by 
sequence comparison truly represent SSBs, we purified and 
characterized the candidate SSB from M. jannaschii. The 
MJ1159 ORF was inserted into the pET-28 bacterial expres- 
sion vector and was expressed as a fusion protein containing 
an N-terminal his 6 tag. The tagged mjaSSB protein was largely 
soluble and was purified to near homogeneity by affinity 
chromatography on Ni-chelate agarose and ssDNA-agarose. 
Analysis of the purified protein by SDS/PAGE revealed a 
single major band with a molecular mass of 80 kDa, in 
agreement with the predicted molecular mass of 77 kDa (data 
not shown). Analytical gel filtration and sedimentation studies 
demonstrated that the native mjaSSB has a Stokes radius of 42 
A and a sedimentation coefficient of 4.4 S. The molecular mass 
of mjaSSB calculated from these values is 76 kDa, indicating 
that the protein is a monomer in solution. 

DNA-Binding Properties of mjaSSB. During the purifica- 
tion of the putative mjaSSB, we noted that a high electrolyte 
concentration in the presence of a chaotropic agent was required 
to elute the protein from ssDNA agarose. No protein eluted from 
the column at 1M NaCl, but quantitative recovery was obtained 
by elution with 2 M NaCl/40% ethylene glycol. To confirm the 
ability of mjaSSB to bind to ssDNA, we carried out agarose gel 
mobility shift assays with radioactively labeled single-stranded 
dT-oIigonucleotides 30, 50, and 70 nucleotides in length (Fig. 2). 
When (dT) 30 was incubated with increasing concentrations of 
mjaSSB, a single band of reduced mobility was observed. Most of 
the (dT) 30 was shifted after addition of one molar equivalent of 
mjaSSB, and the mobility of the shifted band remained constant 
at higher protein concentrations, A band of identical mobility was 
observed for (dT) 50 at low oligonucleotide concentrations, but a 
second shifted band with a lower mobility was observed at high 
protein concentrations. In the case of (dT) 70 , a third shifted band 
appeared at the highest protein concentrations. Assuming that 
mjaSSB is a monomer, the simplest interpretation of these data 
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Fig. 2. Binding of mjaSSB to single-stranded oligonucleotides, A 
fixed quantity of [5'- 32 P]oligonucleotide (38 fmol of dT 30) 32 fmol of 
dT 50) and 32 fmol of dT 7 o) was incubated with 0, 5, 20, 50, and 200 fmol 
of mjaSSB in 20-fd reaction mixtures at 25°C Protein-DNA complexes 
were separated from free DNA by agarose gel electrophoresis. 

is that a single protein molecule binds to (dT)3o whereas two and 
three protein molecules, respectively, bind to (dT) 50 and (dT) 70 - 
Under similar experimental conditions, a single molecule of 
hRPA binds to (dT) 30 and (dT) 50 whereas two molecules of hRPA 
bind to (dT) 70 (23, 24). Thus, the binding site size for mjaSSB may 
be somewhat smaller than that of hRPA. 

To determine the apparent affinity of mjaSSB for ssDNA, 
radioactively labeled (dT) 30 was incubated with increasing 
quantities of protein, and the fraction of DNA bound was 
measured by agarose gel mobility shift assay as before (Fig. 3). 
The apparent binding constant of mjaSSB for (dT) 3 o was on the 
order of 2 X 10 10 M _1 . The apparent affinity of hRPA for 
ssDNA measured by the same experimental method is «1 x 
10 10 M" 1 (23, 24, 27). 

To compare the affinity of mjaSSB for single- and double- 
stranded DNA, we performed binding competition studies. 
Reaction mixtures containing fixed amounts of mjaSSB and 
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Fig. 3. Binding isotherm for mjaSSB with (dT) 30 . Binding reaction 
mixtures (20 contained 0.03 fmol of [5'- 32 P]dT 30 and the indicated 
concentration of mjaSSB. Protein-DNA complexes were separated 
from free DNA by agarose gel electrophoresis, and the radioactivity 
in each species was determined by using a phosphorimager. The 
apparent Ka is 2 X 10 10 M -1 . 
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competitor DNA (M) 

Fig. 4. Relative affinity of mjaSSB for single- and double-stranded 
DNA. Reaction mixtures (20 /il) contained a fixed quantity of mjaSSB 
(1.75nM) and [ 32 P]dT 30 (0.05nM). Before incubation, the [ 32 P]dT 30 
was mixed with single- or double-stranded Ml 3 DNA to the indicated 
final concentration. Protein-DNA complexes were separated from 
free DNA by agarose gel electrophoresis, and the fraction of DNA 
bound to protein was estimated by quantification of digitized images 
of scanned autoradiograms. 

radioactive (dT) 30 were incubated with increasing concentra- 
tions of unlabeled single- or double-stranded M13 DNA (Fig. 
4). In the absence of competitor, the concentration of mjaSSB 
was sufficient to drive most of the (dT) 30 into protein-DNA 
complexes. Approximately lOOx as much double-stranded 
competitor DNA as single-stranded competitor DNA was 
required to reduce the fraction of (dT) 3 n in protein-DNA 
complexes to 50%. These data indicate that the apparent 
affinity of mjaSSB for ssDNA is at least 100X that for 
double-stranded DNA. This estimate probably represents a 
lower limit because preparations of double-stranded M13 
DNA likely contain low levels of contamination with ssDNA. 

Binding Site Size of mjaSSB. To further explore the binding 
properties of mjaSSB, we made use of fluorescence spectros- 
copy. The mjaSSB protein contains six tryptophan residues, 
and, based on the alignment of the mjaSSB repeats with the 
sequence of RPA, five of the six residues are likely to be located 
in DNA binding sites (see Fig. 1). We observed that the 
intrinsic fluorescence of mjaSSB, like that of RPA, is quenched 
on DNA binding (23). With an excitation wavelength of 295 
nm, the emission spectrum of mjaSSB at 25°C had a maximum 
at 348 nm, consistent with tryptophan fluorescence. On addi- 
tion of a saturating quantity of ssDNA, the intrinsic fluores- 
cence at 348 nm was reduced (quenched) by «*60% (Fig. 5). 
DNA-dependent quenching also was observed at 50°C (data 
not shown), indicating that mjaSSB is active in DNA binding 
at high temperature. 

The occluded binding site size of a SSB is the length of DNA 
rendered inaccessible when the protein is bound. To estimate 
this binding parameter for mjaSSB, we used fluorescence 
measurements to follow binding of the protein to DNA under 
stoichiometric binding conditions (Fig. 5). When a fixed 
quantity of mjaSSB was titrated with poly(dT), the quenching 
of mjaSSB fluorescence increased linearly with DNA concen- 
tration until saturation was reached (Fig. 5). At saturation, the 
molar ratio of nucleotide to protein was 21. This binding site 
size is somewhat smaller than that of RPA, which was esti- 
mated to be 30 nucleotides by similar methodology (23). 

We also studied the interaction of mjaSSB with oligonucle- 
otides of different lengths by using gel mobility shift assays. 
Fig. 6 shows the results of an experiment in which a fixed 
quantity of mjaSSB was incubated with radioactively labeled 
(dT) n oligonucleotides ranging in length from 10 to 30 nucle- 
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Fig. 5. Binding site size of mjaSSB. Reaction mixtures (2 ml) 
contained a fixed amount of mjaSSB (0.4 nmol). The indicated 
amounts of poly(dT) were added, and the fluorescence, at 348 nm was 
measured at 25°C with an excitation wavelength of 295 nm. The 
percent quenching is the change in mjaSSB fluorescence normalized 
to the initial fluorescence in the absence of poly(dT). 



otides. The molar concentration of oligonucleotide added to 
each reaction mixture was the same, so that the relative amount 
of protein-oligonucleotide complex formed in each case rep- 
resented a measure (albeit qualitative) of the relative affinity 
of binding to mjaSSB. Under the conditions of this experiment, 
about three-fourths of the (dT) 30 was shifted to the position of 
protein-DNA complexes. A slightly smaller amount of pro- 
tein-DNA complexes was observed for (dT)2o- For oligonu- 
cleotides with lengths <20, the binding of mjaSSB showed a 
strong dependence on length, and complexes between (dT)io 
and mjaSSB were not detectable at all (although such com- 
plexes could be observed at higher protein concentrations). 
These data are consistent with a binding site size of 15-20 
nucleotides for mjaSSB. 
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Fig. 6. Interaction of mjaSSB with oligonucleotides of different 
lengths. Reaction mixtures (20 /xl) contained 1 fmol of oligonucleotide 
of the indicated length and 5 fmol of mjaSSB. After incubation at 25°C 
for 30 min, protein-DNA complexes were separated from free DNA 
by agarose gel electrophoresis. The fraction of DNA bound to protein 
was estimated by quantitation of digitized images of scanned autora- 
diograms. 



14638 Biochemistry: Kelly et ai 



Proc. Natl Acad. ScL USA 95.(1998) 



DISCUSSION 

When the first complete sequence of an archaeal genome was 
published, it became apparent that the predicted amino acid 
sequences of archaeal DNA replication proteins more closely 
resemble those of eukaryotes than bacteria (Table 1) (18, 28). 
This is consistent with a large body of evidence indicating that 
the divergence of archaea and eukaryotes is more recent than 
the divergence of bacteria and eukaryotes/archaea (29). Ar- 
chaea and eukaryotes contain some replication proteins for 
which no bacterial homologue is yet evident. This is especially 
striking for proteins thought to be involved in initiation of 
DNA replication in eukaryotic cells, such as members of the 
Orel /Cdcl8/Cdc6 and MCM families of initiation proteins. In 
other cases, it is possible to identify a likely bacterial homo- 
logue whose amino acid sequence is distantly related to the 
corresponding eukaryotic/archaeal replication protein. For 
example, there is weak, but significant, primary sequence 
similarity between the proteins of bacteria and eukarya/ 
archaea that are responsible for loading processivity factors at 
the replication fork (''clamp-loading complexes"). The simi- 
larity of both sequence and function suggests that these 
proteins evolved from a common ancestor (28). Even in cases 
in which no amino acid sequence similarity can be detected 
between a eukaryotic/archaeal replication protein and its 
bacterial counterpart, a likely evolutionary relationship can be 
revealed by structural studies. Thus, the processivity factor of 
eukaryotes, PCNA, shows no sequence similarity to the j3 
clamp of E. coli, but the three dimensional structures of the two 
proteins are superimposible (30). 

Our data indicate that the archaeal SSB has detectable 
primary sequence similarity to eukaryotic RPA but not to £. 
coli SSB. Although it has been suggested that that E. coli SSB 
and RPA share amino acid sequence similarity (14), the 
proposed sequence alignment is very weak and is inconsistent 
with the available structural data (see below). Thus, on the 
basis of sequence data alone, one cannot conclude that the 
archaeal/eukaryotic SSBs and the bacterial SSBs have a 
common origin. However, comparison of the three- 
dimensional structures of the ssDNA-binding domains of RPA 
and E. coli SSB makes a compelling case for an evolutionary 
relationship between the two proteins. The alignment of the 
first ssDNA-binding domain of RPA with E. coli SSB reveals 
a rms deviation of 2.2 A for the equivalent C-a atoms over a 
length of 72 amino acid residues (Biomolecule 3D Structure 
data base, National Center for Biotechnology Information; see 
ref. 31). The probability of this structural similarity arising by 
chance is 10" 5 . Thus, it is likely that the SSBs of modern 
organisms share a single core ssDNA-binding domain that was 



already present in the common ancestor to bacteria, archaea, 
and eukarya. We suggest that most, if not all, of the protein 
functional units at the replication fork are of similarly ancient 
origin. With respect to the proteins specifically involved in 
initiation, it is not yet clear whether novel functional units 
evolved in the lineage that gave rise to the archaea and eukarya 
or whether there is simply insufficient data to allow recogni- 
tion of possible bacterial homologues. 

The purified mjaSSB has high affinity and selectivity for 
ssDNA. The apparent association constant' is similar to that of 
RPA under comparable experimental conditions. Although we 
have not studied the temperature dependence of DNA binding 
by mjaSSB in detail, our fluorescence quenching data indicate 
that the protein binds ssDNA at high temperatures. M jann- 
aschii grows at an optimum temperature of 85°C at pressures 
up to 200 atrri (18). Thus, it will be of considerable interest to 
explore further the interaction of mjaSSB with oligonucleo- 
tides under more extreme conditions. 

The occluded binding site size of mja{5SB is in the neigh- 
borhood of 20 nucleotides. This value is consistent with 
expectations based on analysis of the structure of human RPA. 
The ssDNA-binding channel of each domain of hRPA70 
accommodates three nucleotides, and the space between the 
two domains is occupied by two nucleotides (3). Because 
mjaSSB has four tandem ssDNA-binding domains, an oc- 
cluded binding site size of at x least 12-20 nucleotides is 
predicted. In addition to the four DNA-binding domains, 
mjaSSB, like other archaeal SSBs, possesses a putative zinc 
finger motif near the C terminus (20). RPA70 contains a 
similar motif, but genetic studies indicate that it is not required 
for ssDNA binding (1). 

It is interesting that mjaSSB contains four tandem repeats 
of the conserved core ssDNA-binding domain. The homo- 
logues of mjaSSB in other archaeal species ajso are organized 
as tandem repeats. The SSB of.M thermoautotrophicum con- 
tains five repeats of the core ssDNA-binding domain. The A. 
fulgidus genome encodes two different SSBs, each of which 
contains two such repeats. As noted above, it has been 
suggested on the basis of sequence comparisons that the RPA 
may contain as many as four core ssDNA-binding domains 
distributed over the three subunits (14). Three of these have 
been verified by biochemical studies (1, 15). The E. coli SSB 
contains only a single core ssDNA-binding domain, but the 
protein functions as a homotetramer (2). Thus, the functional 
form of all SSBs appears to consist of several core ssDNA- 
binding domains, but these domains can be associated with one 
or more distinct polypeptide chains. This organization may 
impart considerable flexibility to the interactions of SSBs with 
ssDNA. Indeed, multiple modes of protein-DNA interaction 



Table 1. Homologues of eukaryotic DNA replication proteins 







Archaeal 


Bacterial 


Eukaryotic protein 


Functional role 


homologue 


homologue 


Orcl/Cdcl8/Cdc6 


Initiation 


Yes 


? 


MCM complex 


Initiation 


Yes 


? 


RPA (ssDNA binding) 


Initiation/elongation 


Yes 


Yes 


Primase 


Initiation/elongation 


Yes 


? 


DNA polymerase (B family) 


Elongation 


Yes 


Yes 


RFC (clamp loader) 


Elongation , 


Yes 


Yes 


PCNA 


Elongation 


Yes 


Yes 


DNA ligase 


Elongation 


Yes 


Yes 


FENI 


Elongation 


Yes 


? 


Ribonuclease H 


Elongation 


Yes 


Yes 


Dna2 helicase 


Elongation 


Yes 


? 


Topoisom erase I 


Elongation 


Yes 


Yes 



Entries in the table are derived in part from ref. 28 and in part from blast searches with the archaeal 
and/or eukaryotic replication protein as query. See also ref. 6, 18, and 19 and the web sites 
http://www.tigr.org/tdb/mdb/mjdb/mjdb.html, http://www.tigr.org/tdb/mdb/afdb/afdb.html, and 
http://www.biosci.ohio-state.edu/-genomes/mthermo/. 
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have been reported for both RPA and E. coli SSB (1, 2). The 
modular structure of SSBs also may function to promote very 
tight binding to ssDNA while at the same time allowing the 
rapid stepwise removal of the proteins from the DNA as must 
occur during DNA chain elongation on a ssDNA template. 

The mjaSSB protein appears to be considerably less complex 
than RPA70. In particular, it lacks the N- and C-terminal 
regions through which RPA70 interacts with a variety of other 
cellular proteins. The latter domains of RPA70 likely evolved 
after the divergence of archaea and eukarya as a response to 
the more complex regulatory requirements of replicating very 
large genomes with multiple origins of replication. In addition, 
the published genomes of archaeal species do not reveal 
sequences with similarity to RPA32 and RPA14 subunits of 
RPA, so it is possible that mjaSSB functions as a single 
polypeptide chain. Thus, mjaSSB may represent a simple 
model for studying the structure, function, and evolution of the 
conserved core ssDNA-binding domain. 
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